Attorney Docket No. 9432-0001 72 
ENHANCED METHOD FOR DIGITAL DATA HIDING 

FIELD OF THE INVENTION 
[0001] The present invention relates generally to media data hiding, 
and more particularly, to a method for enhancing spread spectrum watermarking. 

BACKGROUND OF THE INVENTION 
. [0002] The proliferation of digitized media is creating a pressing need 
for techniques that protect copyright owners. Watermarking is a known 
technique for protecting copyright owners. Watermarking is a process of hiding 
information in digitized media for purposes of ownership verification and content 
protection. As expected, introduction of distortion into the host data during 
watermark embedding is unavoidable; nevertheless, it is usually favorable to 
have host data that is perceptually indistinguishable from its original form after 
ertibedding the watennark. 

[0003] One well known watermarking technique is based on spread 
spectrum communications. In spread spectrum communications, a narrowband 
signal is transmitted across a wider carrier frequency band such that the signal 
energy present in any single frequency is undetectable. Similarly, a watermark 
may be embedded in host media data by spreading the embedded data over a 
large frequency band. Spread spectrum watermark is designed to be difficult 
(preferably virtually impossible) to remove from the host signal without rendering 
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the host signal useless for its intended purposes and without imposing any 
perceptual artifacts introduced by the inserted watermark. 

[0004] In an article by Ingemar J. Cox et al., entitled "Secured 
Spectrum Watermarking for Multimedia" in IEEE Transactions on Image 
Processing, Vol. 6, No. 12, December 1997, as well as U.S. Patent No. 
5,930,369 by Cox et al, entitled "Secure spread spectrum watermarking for 
multimedia data", spread spectrum watermarking is described which embeds a 
pseudo-random noise sequence into digital data for watermarking purposes. 
Extraction of watermark requires the availability of the original image. In U.S. 
Patent No. 5,848,155 by Cox et al, entitled "Spread spectrum watermark for 
embedded signaling", the watermark is extracted from watermarked data without 
the use of an original or unwatermarked version of the data. This approach 
overcomes the limitations of the prior systems by using spread spectrum 
technology to embed watermark data or information into predetermined locations 
in an image. 



SUMMARY OF THE INVENTION 
[0005] In accordance with the present invention, an improved method 
is provided for spread spectrum watermarking. The improved watermarking 
method includes: receiving host data which is expressed in a transform domain 
by a plurality of host transform parameters; reducing variance between the 
plurality of host transform parameters, thereby forming an enhanced sequence of 
host transform parameters; and adding a watermark to the enhanced sequence 



of host transform parameters using a spread spectrum technique. The enhanced 
sequence of host transform parameters may be formed by sorting the plurality of 
host transform parameters in at least one of an ascending order or descending 
order; determining a difference for each pair of consecutive host transform 
parameters, and alternating the sign of every other difference value, thereby 
forming the enhanced sequence of host transform parameters. The present 
invention is applicable to the watermarking of various types of digital data stream, 
such as audio, Image video, software and multimedia. 
© [0006] For a more complete understanding of the invention, its objects 

13 and advantages, reference may be had to the following specification and to the 
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tj| accompanying drawings. 

|l BRIEF DESCRIPTION OF THE DRAWINGS 

ft! 

P [0007] Figure 1 is a flowchart of a method for enhanced spread 

fij spectrum watermarking in accordance with the present invention; and 

[0008] Figures 2A and 2B are block diagrams of an exemplary audio 

encoder and audio decoder, respectively, in accordance with the present 

invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0009] Spread spectrum watermarking is a well known technique for 
embedding watermark data into digitized media. Further explanation of this 
technique may be found in technical resources, including an article by Ingemar J. 
Cox et al entitled "Secure Spread Spectrum Watermarking for Multimedia" in 
IEEE Transactions on Image Processing, Vol. 6, No. 12, December 1997 which 
is incorporated by reference herein. However, a mathematical description of 
spread spectrum watermarking is provided below as a basis for understanding 
the present invention. 

[0010] Assume that we are going to embed a single bit fee {l ,-l} into a 
host signal sequence x = Xy X2_ •••,Xj^_ Further assume that k=kik2, •••,k^ is the 
watermark key which is randomly selected but known to the watermark detector, 
where g {l,-l}, 1 < i < and that each individual sample of the key is zero 
mean and independent to each other. Let A = Ai A2 ••-.A^ be the scaling 
sequence that determines the amount of distortion added to the host sequence, 
where A,.>0, !</ < A^, and finally the watermarked signal is x=x{,x2,---,x'^, 
where x' = x,. + l<i<N. 

[0011] To decode the watermark, we start with Jc,. =x{ + which is a 
noisy version of x^, where n = nj.nj,- is some additive noise sequence 

introduced in the watermarked signal processing, transformation, or intentional 
watermark attacks. Define 



m 



m 
m 



1 

then an estimate of the embedded information b can be formed as b= sign (D). 
If we assume that x, k, ^' and n are independent to each other and that 



E[A, ] = E[A], l<i<N, then D = E[D] = E 



al=E[iD-Dy] 



' I If 

— j;^,(^,+M,A:,+n,) 

. ^ ' 1=1 



= bE[A], and 



= E 



|:|;^,(x,+n,)^2^,(^,+n,) 



= 1^1] f,E[k,kj]E[ix,+n,)(Xj+nj)] 

^ i=l M 



_ j: n 
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[0012] Since D = bE[A] ^^^^ E[A] jg positive, ^ = sign (D) indeed 



D 



gives an appropriate estimate of * . Furthermore, the larger the value of 
the more certain the estimate will be. In the absence of noise. 



D 



E[AhfN 



a — . In the presence of noise h 



e\a\[n 



which is 



also proportional to 1/ . Hence, the smaller the host signal variance, the more 
accurate the abstraction of watermark, the more robust the watermarking 



scheme. 
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[0013] Figure 1 illustrates a method for enhancing spread spectrum 
watermarking of digitized media by reducing the variance of the host signal prior 
to adding the watermarlc. The host signal is commonly expressed in a transform 
domain by a plurality of host transform parameters. Although the frequency 
domain is presently preferred, it is envisioned that the present invention is also 
applicable to host data expressed in other spectral domains (e.g. cepstrum 
domain). 

[0014] Upon receiving the host signal at step 12, the host transform 
P parameters (e.g. frequency coefficients) may be sorted in an ascending at step 

''^'^ 14 to obtain ~ "^'^ It is to be understood that the host transform 



J'l parameters may also be sorted in a descending order. 

[0015] Next, an enhanced sequence of host transform parameters is 



formed by determining the difference for each pair of consecutive host transform 
parameters at step 16, and then alternating the sign of every other difference 
value at step 18. In other words, the enhanced sequence y = yi,y2,---,yy^, is 



derived in accordance with y j=i-iy(x^^^_^-x^^^), 1< j <N/^ (assuming N 



IS 



even). To ensure that y has an approximately zero mean, we explicitly make the 

two consecutive 's have alternative signs. Without the sign alternation, one 
skilled in the art will note that y will be the high pass band of the Haar wavelet 
transform of the sorted version of x. Since neighboring samples in the sorted 
sequence of host transform parameters are close in value, it is expected that the 
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sample values of y will be much smaller than those of x. Hence, in 
general. It is to be understood that there are many other ways to modify Xs2n-i 
and Xs2n to get . In each application, It is desirable to adjust Xs2n-i and Xs2n in 
the way that minimizes perceptual alteration introduced via watermark 
embedding and preserves the order of Xsan-i and Xs2n- For instance, a 
perceptual model, such as the JND (Just Noticeable Difference) function used in 
JPEG compression, may be used to guide it. 

[0016] Lastly, a watermark is added to the enhanced sequence of host 
transform parameters at step 20 using well known spread spectrum techniques. 
For example, let the watermark key be V.=k^,k^,--k^, ,k^^\^\^, and the 

embedding bit be ^- That is, ^ ^^KK- one skilled in the art will readily 
recognize that this technique may be extended to embed a plurality of bits into 
the host signal. As in ordinary spread spectrum technique, A„ controls the 
amount of distortion that can be added to y„ which should be guided by 
perceptual model if the host media data is perceptual data. Since 

-(-1) ti^Q perturbation ^^'^'^^ can be achieved by applying the 

distortion to either ^"^-^ or ^*^» or both. This extra flexibility is useful because 

^*2«-> and ^^2" may have unequal susceptibility to noise. For simplicity, we split 

the distortion evenly between the pair. Thus we get ^« ^ ^« '^^^nK as desired, if 
we construct 



[0017] To decode the watermark, we first form an estimate 
y = 5'i,3'2'""3'a^ with the help of the sorting indices s^s^-'-Si^. The embedded bit 

b is then the sign of ^^(^l^k^y^, as in an ordinary spread spectrum watermark. 

During decoding, it is understood that the sorting indices and the watermark key 
are required. Instead of transferring them separately, it is envisioned that the 
sorting indices and the watermark can be combined into a single signal 
dependent key which is in turn transmitted to the decoder. 

[0018] Compression and transcoding of digital media are often needed 
for various applications, including delivery through a different communication 
channel of different bandwidth and use in different devices. For example, an 
audio is compressed using MPEG II AAC schemes or a compressed audio in 
MPEG II AAC format is transcoded to MPEG I layer 3, MPS, format. Such 
reconstruction of the media signal via compression or transcoding can be lossy. 
In general, this does not alter the signal perceptibility. In many applications, it is 
often desirable that the embedded watermark survives such lossy reconstruction 
as well as other types of lossy signal processing and it should remain detectable 
within the reconstructed media signal. 

[0019] Figure 2 illustrates how the enhanced spread spectrum 

watermarking technique of the present invention may be applied to a 

compressed audio signal. For illustration purposes, the compressed audio data 
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is compressed in accordance with the well known Advanced Audio Coding (AAC) 
compression standard. While the following description is provided with reference 
to compressed audio data, it is readily understood that the present invention is 
applicable to other types of media data and other types of digital data streams, 
including image data, video data, multimedia data and software data. 

[0020] In the exemplary encoder 30, the compressed audio data 32 is 
input into a partial decoder 34. The partial decoder is in turn operable to partially 
decode the compressed data to recover a plurality of quantization indices. 
Rather than embed a watermark into frequency coefficients, the watermark will 
be embedded into the quantization indices. Since no dequantization or 
requantization is necessary, this approach increases the speed of the overall 
process. However, it is envisioned that embedding a watermark into the 
frequency coefficients is also within the scope of the present invention. 

[0021] The quantization indices from the compressed audio data are 
then input into an index selector 36. The Index selector 36 applies perceptual 
modeling to estimate the maximum amount of distortion that can be withstood by 
each quantization index. Perceptual modeling is generally known to those in the 
art. However, perceptual modeling information is not easily accessible from 
compressed audio, and thus a heuristic approach may be used to apply 
perceptual modeling to quantization indices. Further implementation details for 
an exemplary heuristic approach are disclosed in U.S. Patent application entitled 
"Method and Apparatus for Embedding Data in Compressed Audio Data Stream" 
filed on February 25, 2002 by the assignee of the present invention which is 



incorporated herein by reference. In any event, selected quantization indices are 
output by the index selector 36. 

[0022] Prior to adding a watermark, the selected quantization indices 
may be enhanced as described above. To do so, the selected quantization 
indices are sorted in either of an ascending order or descending order by the 
index sorter 38. The sorted quantization indices then serve as an input to an 
enhanced index sequencer 40. The enhanced index sequencer 40 computes a 
difference for each pair of consecutive quantization indices, and alternates the 
sign of every other quantization index, thereby forming an enhanced sequence of 
quantization indices. 

[0023] A watermark encoder 42 adds a watermark to the enhanced 
sequence of quantization indices using well known spread spectrum techniques. 
In addition to the enhanced sequence of quantization Indices, the watermark 
encoder 40 receives as input watermark data and watermark key data. Lastly, 
the watermarked quantization indices are compressed with a Hoffman coding by 
a partial encoder 46 to form a compressed watermarked audio signal. 

[0024] The exemplary audio encoder 30 may optionally include a key 
multiplexer 44. As noted above, the decoding process requires the watermark 
key as well as the sorting indices from the index sorter 38. The key multiplexer 
44 combines watermark key with the sorting indices to form a single decoding 
key. A corresponding audio decoder 50 is depicted in Figure 2B. 

[0025] The foregoing discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily 

10 



recognize from such discussion, and from accompanying drawings and claims, 
that various changes, modifications, and variations can be made therein without 
departing from the spirit and scope of the present invention. 
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